Model-Based Reinforcement Learning using Model Mediator in Dynamic Multi-Agent Environment

نویسندگان

چکیده

Centralised training and decentralised execution (CTDE) is one of the most effective approaches in multiagent reinforcement learning (MARL). However, these CTDE methods still require large amounts interaction with environment, even to reach same performance as very simple heuristic-based algorithms. Although modelbased RL a prominent approach improve sample efficiency, its adaptation multi-agent setting combining existing has not been well studied literature. The few studies only consider settings relaxed restrictions on number agents observable range. In this paper, we where some information about each agent’s observations (e.g. visibility, agents) are changed dynamically. such setting, fundamental challenge how train models that accurately generate complex transitions addition central state, use it for efficient policy learning. We propose model based algorithm novel architecture consisting global local prediction mediator. evaluate our model-based applied an method challenging StarCraft II micromanagement tasks show can learn fewer interactions environment.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Agent Based Dynamic E-Learning Environment

Grids are increasingly being used in applications, one of which is e-learning. As most of business and academic institutions (universities) and training centres around the world have adopted this technology in order to create, deliver and manage their learning materials through the Web, the subject has become the focus of investigate. Still, collaboration between these institutions and centres ...

متن کامل

Using Advice in Model-Based Reinforcement Learning

When a human is mastering a new task, they are usually not limited to exploring the environment, but also avail themselves of advice from other people. In this paper, we consider the use of advice expressed in a formal language to guide exploration in a model-based reinforcement learning algorithm. In contrast to constraints, which can eliminate optimal policies if they are not sound, advice is...

متن کامل

Accelerating Multi-agent Reinforcement Learning with Dynamic Co-learning

We introduce an approach to adaptively identify opportunities to periodically transfer experiences between agents in large-scale, stochastic, homogeneous, multi-agent systems. This algorithm operates in an on-line, distributed manner, using supervisor-directed transfer, leading to more rapid acquisition of appropriate policies in systems with a large number of cooperating reinforcement learning...

متن کامل

Multi-Agent Reinforcement Learning

This thesis presents a novel approach to provide adaptive mechanisms to detect and categorise Flooding-Base DoS (FBDoS) and Flooding-Base DDoS (FBDDoS) attacks. These attacks are generally based on a flood of packets with the intention of overfilling key resources of the target, and today the attacks have the capability to disrupt networks of almost any size. To address this problem we propose ...

متن کامل

Reinforcement Learning: Model-based

Reinforcement learning (RL) refers to a wide range of dierent learning algorithms for improving a behavioral policy on the basis of numerical reward signals that serve as feedback. In its basic form, reinforcement learning bears striking resemblance to ‘operant conditioning’ in psychology and animal learning: actions that are rewarded tend to occur more frequently; actions that are punished ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of The Japanese Society for Artificial Intelligence

سال: 2023

ISSN: ['1346-0714', '1346-8030']

DOI: https://doi.org/10.1527/tjsai.38-5_a-mb1